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• Premise of the study: Novel microsatellite markers were characterized in the wind-dispersed and dioecious neotropical tree 
Triplaris cumingiana (Polygonaceae) for use in understanding the ecological processes and genetic impacts of pollen- and 
seed-mediated gene flow in tropical forests. 

• Methods and Results: Sixty-two microsatellite primer pairs were screened, from which 12 markers showing five or more alleles 
per locus (range 5-17) were tested on 47 individuals. Observed and expected heterozygosities averaged 0.692 and 0.731, re- 
spectively. Polymorphism information content was between 0.417 and 0.874. Linkage disequilibrium was observed in one of 
the 66 pairwise comparisons between loci. Two loci showed deviation from Hardy- Weinberg equilibrium. An additional 14 
markers exhibiting lower polymorphism were characterized on a smaller number of individuals. 

• Conclusions: These microsatellite markers have high levels of polymorphism and reproducibility and will be useful in studying 
gene flow and population structure in T. cumingiana. 

Key words: gene flow; microsatellite marker; PacBio sequencing platform; single-molecule real-time sequencing; Triplaris 
cumingiana; wind dispersal. 



Triplaris cumingiana Fisch. & C. A. Mey. ex C. A. Mey. 
(Polygonaceae) is a wind-dispersed, dioecious tree species 
found in humid forests of lower Central America and western 
South America (Croat, 1978). It forms an obligate mutualistic 
relationship with the stinging ant Pseudomyrmex triplarinus 
(Croat, 1978) as an antiherbivore defense. Unlike most dioe- 
cious tree species that have inconspicuous unisexual flowers 
(Bawa and Opler, 1975), flower sexual dimorphism is pro- 
nounced in T. cumingiana, which produces bright red bracts 
signaling flowers on female trees during the dry season in Pan- 
ama. The dioecious mating system, which permits unambiguous 
identification of the maternal and paternal contribution to 
seedlings, combined with the ease of sexual determination, 
make T. cumingiana of particular interest for studies of pol- 
len- and seed-mediated gene flow in tropical tree species. To 
investigate the ecological and genetic impacts of pollen and 
seed dispersal in T. cumingiana as compared to tropical trees 
of alternative pollination and dispersal syndromes, we devel- 
oped polymorphic microsatellite markers in T. cumingiana. 
We used single-molecule real-time sequencing (SMRT) im- 
plemented in the PacBio RS platform (Pacific Biosciences, 
Menlo Park, California, USA) because it is capable of gener- 
ating long reads (Wei et al., 2014). 
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METHODS AND RESULTS 

Genome shotgun sequences were obtained using PacBio' s high accuracy mode 
of circular consensus sequencing (Wei et al., 2014). In brief, genomic DNA from 
one reproductive-sized tree of T. cumingiana that grows in the 50-ha Forest Dy- 
namics Plot (FDP) on Barro Colorado Island, Panama (9°10'N, 79°51'W; tag no. 
199017), was used for PacBio 500-bp SMRTbell library preparation. Following 
sonication, DNA fragments averaging 500 bp were ligated with two 55-nucleotide 
hairpin adapters, and then sequenced on the PacBio RS platform using C2 chemis- 
try. Four SMRT cells generated a total of 178,122 circular consensus reads. Quality 
control was performed to remove homopolymer-rich sequences and poor-quality 
portions of individual reads (for details, see Wei et al., 2014). The resulting high- 
quaUty sequences were used for microsatellite searching and primer design in QDD 
version 2.1 (Meglecz et al., 2010). In total, 795 microsatellite loci were obtained, in 
which 686 loci contained pure repeat motifs (524 di-, 143 tri-, 15 tetra-, three penta-, 
and one hexanucleotide repeat motifs). From these pure microsatellites, loci of >9 
repeats with dinucleotide motifs and loci of >7 repeats with other repeat motifs were 
retained for marker validation. This test array was composed of 62 microsatellite 
loci (32 di-, 25 tri-, three tetra-, one penta-, and one hexanucleotide motifs). 

To test these primers, we isolated genomic DNA using a modified cetyltri- 
methylammonium bromide (CTAB) method (Doyle and Doyle, 1987) from ly- 
ophilized leaves of 47 T. cumingiana adult trees growing in the 50-ha FDP 
(voucher no. Perez 1862, Smithsonian Tropical Research Institute herbarium 
[STRI], Panama). After the initial check of primer amplification on three indi- 
viduals, we found 39 primer pairs generated easily interpretable allelic patterns, 
of which 1 1 loci were monomorphic. We then tested the remaining 28 polymor- 
phic markers on another nine individuals, for which two loci showing apparent 
null alleles were excluded from further analyses. Microsatellite loci of >5 alleles 
based on these 12 samples were screened on an additional 35 individuals. The 
8-|iL PGR reactions contained 4 ng DNA, 6.25 nM HEX-labeled or 9.40 nM 
FAM-labeled M13 primer (TGTAAAACGACGGCCAGT), 0.075 |iM M13- 
tagged forward primer, 0.3 jiM reverse primer, 4 mM MgCl2, 4 jiL GoTaq Color- 
less Master Mix (Promega Corporation, Madison, Wisconsin, USA) including 
200 |iM of each dNTP and 1 unit Tag DNA polymerase, and H2O. PCRs were 
carried out using two different thermocycling conditions. For most of the tested 
primers, we used a touchdown protocol (a; Table 1): 94°C for 4 min; 28 cycles of 
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Table 1. Characteristics of 12 polymorphic micro satellite markers developed in Triplaris cumingiana. 



Locus 




Primer sequences (5^-30'' 


Repeat motif 


Allele size range (bp) 


7;(°c)b 


GenBank accession no. 


TRI_01 


F 
R 


GGCTTTAATTCACCATTTAGCC 
TTGCATCCACACCTAGCAAC 


(AAT)8 


337-418 


b 


KF680412 


TRI_07 


F 
R 


GCCTGACATGATCAAATCCTC 
TTTCAATTGTTGACGGGATG 


(ACAT)8 


220-364 


b 


KF680415 


TRI_09 


F 
R 


GAAGTTGGCAGTCGAGGTTC 
CAAGCTCCAAACTCCCTCAG 


(AAAOs 


194-242 


a 


KF680417 


TRI_20 


F 
R 


ATTTGCCATCCGCTACTTG 
CTCATCATACGATGGCGTTC 


(AAG)9 


196-217 


a 


KF680422 


TRI_26 


F 
R 


ATAGCCTCTAGCCCGACCTG 
GGGCTCTTCTGCTAGGGTTC 


(ACATAT)7 


196-238 


a 


KF680426 


TRI_27 


F 
R 


TCCCTCAGACTGTCCAAAGC 
AGCCAATTGATTGGTTTCAAG 


(AAG)n 


154-238 


a 


KF680427 


TRI_31 


F 
R 


GCAAATCATAATTGGGCTTACC 
CTGCCCTAAACGATCTCACC 


(AT)9 


200-224 


b 


KF680430 


TRI_38 


F 
R 


TGGCTTGACTTGTCGATGTG 
CCACAATTTACAAACCACAAAG 


(AT)i2 


109-127 


b 


KF680432 


TRI_40 


F 
R 


TACACGGGAGCTTGATTTCC 
ATAAACCTAGGCACGGAGGC 


(AG)io 


232-254 


a 


KF680433 


TRI_45 


F 
R 


T C AT GAGGGAAGAT GAGT T C G 
AAATAAATTGGGCACGATAGC 


(AG)26 


106-122 


a 


KF680437 


TRI_49 


F 
R 


GTCGGCCTGCTTCTTTCTC 
TGCGACTTGTAACTGCAACG 


(AG)i9 


123-149 


a 


KF680440 


TRI_55 


F 
R 


AACCCTTGACGAGTCATTGC 
C AAT T T GAAGC AAGC T GAGT G 


(AG)n 


288-304 


a 


KF680444 



Note: = annealing temperature. 

^M13 tail (TGTAAAACGACGGCCAGT) added to the 5' end of each forward primer. 

''a = 59°C (decreasing 0.2°C per cycle) in a touchdown PGR protocol; b = 54.5°C in a nontouchdown PGR protocol (see text for details). 



94°C for 30 s, 59°C (decreasing 0.2°C per cycle) for 40 s, and 72°C for 60 s; 10 
cycles of 94°C for 30 s, 53°C for 40 s, and 72°C for 60 s; and a final extension at 
72°C for 10 min. When the above protocol produced weak PGR amplicons, we 
followed a nontouchdown protocol (b; Table 1): 94°C for 4 min; 28 cycles of 
94°C for 30 s, 54.5°C for 40 s, and 72°C for 60 s; 10 cycles of 94°C for 30 s, 
5 1 .5°C for 40 s, and 72°C for 60 s; and a final extension at 72°C for 10 min. PGR 
amplicons were multiplexed by combining one HEX-labeled locus of 1 .6 jiL and 
one FAM4abeled locus of 1 .4 jiL, with 11.5 jiL Hi-Di formamide (Life Tech- 
nologies, Carlsbad, California, USA) and 0.05 |iL GeneScan 500 ROX Size Stan- 
dard (Life Technologies), before loading to a single lane on an ABI 3730 DNA 
Analyzer (Life Technologies). Alleles were called using GeneMarker version 
2.4.1 (SoftGenetics, State College, Pennsylvania, USA). 

We examined marker characteristics including allelic richness, observed 
and expected heterozygosity, and probability of exclusion (PE2, when one 
parent was known; PE3, of a parent pair) using GenAlEx version 6.5 (Peakall 
and Smouse, 2012). Polymorphism information content (PIC) was estimated 
using PowerMarker version 3.25 (Liu and Muse, 2005). Exact tests of Hardy- 
Weinberg equilibrium (HWE) and locus pairwise linkage disequilibrium (LD) 
were conducted in GENEPOP version 4.2.2 (Rousset, 2008). The P values of 
HWE and LD tests were adjusted for multiple comparisons using Holm's cor- 
rection (Holm, 1979). 

We first focused on 12 markers with an average allelic richness of nine (range 
5-17) (Table 2). Observed heterozygosity ranged from 0.426 to 0.936 (mean = 
0.692), and expected heterozygosity was between 0.475 and 0.885 (mean = 
0.731). Locus PIC averaged 0.705, and overall exclusion probability was 0.998 
(with one parent known) and 1.000 (of a parent pair). We observed LD between 
the loci TRI_27 and TRI_31 (Holm's adjusted P = 0.038). Two loci (TRI_26 and 
TRI_38) showed HWE deviation (Holm's adjusted P < 0.007). These 12 micro- 
satellite markers should provide resolution for studying gene flow and genetic 
structure in T. cumingiana. In addition, we provide information on 14 less- 
polymorphic loci (2-4 alleles per locus tested on 12 individuals) (Appendix 1), 
which can be potential candidate markers if more genetic information is required. 



CONCLUSIONS 

We characterized novel microsatellite markers in the dioecious, 
insect-polhnated, wind-dispersed tropical tree T. cumingiana to 



improve understanding of the processes of pollen- and seed- 
mediated gene flow in tropical forests. This will be done in par- 
allel with studies of tree species with alternative pollination and 
dispersal syndromes. These markers will also be useful for 
studying the ecological responses of T. cumingiana (e.g., dis- 
persal, recruitment) to rapid changes in temperature and rainfall 
patterns, as the distribution of this species is associated with 



Table 2. Summary statistics of microsatellite marker polymorphism 
tested on 47 reproductively mature trees of Triplaris cumingiana, 
growing in the 50-ha Forest Dynamics Plot on Barro Colorado Island, 
Panama. 



Locus 


A 


Ho 




PE2 


PE3 


PIC 


TRI_01 


13 


0.915 


0.877 


0.605 


0.909 


0.866 


TRI_07 


17 


0.800 


0.847 


0.554 


0.890 


0.835 


TRI_09 


8 


0.766 


0.740 


0.366 


0.759 


0.717 


TRI_20 


7 


0.830 


0.775 


0.398 


0.769 


0.747 


TRI_26 


8 


0.511* 


0.691 


0.303 


0.700 


0.664 


TRI_27 


15 


0.936 


0.885 


0.626 


0.920 


0.874 


TRI_31 


7 


0.652 


0.790 


0.419 


0.786 


0.762 


TRI_38 


9 


0.511* 


0.825 


0.484 


0.834 


0.804 


TRI_40 


8 


0.660 


0.655 


0.269 


0.670 


0.631 


TRI_45 


6 


0.489 


0.481 


0.127 


0.466 


0.454 


TRI_49 


5 


0.809 


0.736 


0.317 


0.666 


0.688 


TRI_55 


5 


0.426 


0.475 


0.116 


0.384 


0.417 


Mean 


9.0 


0.692 


0.731 


0.998§ 


1.000§ 


0.705 



Note: A = number of alleles per locus; = expected heterozygosity; 
//q = observed heterozygosity; PE2 = probabihty of exclusion with one parent 
known; PE3 = probability of exclusion of a parent pair; PIC = polymorphism 
information content. 

* Significant deviation from Hardy- Weinberg equilibrium after Holm's 
correction (adjusted P < 0.007). 

§ Cumulative probability of exclusion over multiallelic loci. 
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high soil phosphorous content and high dry-season intensity 
(Condit et al, 2013). 
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Appendix 1. Fourteen additional polymorphic microsatellite markers of Triplaris cumingiana screened on 12 individuals sampled from the 50-ha Forest 
Dynamics Plot on Barro Colorado Island, Panama. PCRs follow a touchdown protocol (see text for details). 



Locus 




Primer sequences (5'-30 


Repeat motif 


Allele size range (bp) 


A 


^0 


He 


PIC 


GenBank accession no. 


TRI_06 


F 
R 


CCTTTCCAAACAAGGCTTACC 
GGTCTTGGATCAGCTGAAGG 


(ACT)7 


272-281 


2 


0.083 


0.080 


0.077 


KF680414 


TRI_13 


F 
R 


TGTGTATACCACAAAGCCGAAG 
TCTTCAATCGTTCTGCCTCC 


(AGG)i5 


115-118 


2 


0.667 


0.444 


0.346 


KF680419 


TRI_28 


F 
R 


T C AAAC GAT AC AT TCCATTCTG 
TTGGAATGTTAGGATTGGCG 


(AAT)i4 


108-114 


2 


0.167 


0.444 


0.346 


KF680428 


TRI_30 


F 
R 


AAAGGGAGGAGAAGAAT GGT G 
TCTGCATGGTTGTCTCATAAAC 


(AAT)h 


125-212 


4 


0.250 


0.358 


0.338 


KF680429 


TRI_36 


F 
R 


GGAGTTGACTTGCATTTGGG 
TCATACCCAGTTAACCCATGC 


(AG)9 


163-177 


2 


0.222 


0.444 


0.555 


KF680431 


TRI_44 


F 
R 


TTTAGCCACAATTGCTCAAGAC 
AAAGATCGTCGTTCTCCCAC 


(AT)28 


148-166 


4 


0.250 


0.705 


0.651 


KF680436 


TRI_51 


F 
R 


CATGTACCAAACTGAACCTGTC 
CTCTTGACCGACCGACGAG 


(AC)2o 


147-153 


3 


0.455 


0.368 


0.425 


KF680441 


TRI_52 


F 
R 


TTTCTTGGGTAATTAGTGAGGG 
TAATCCCTGTAGCGTAATCCC 


(AG)23 


115-121 


4 


0.182 


0.380 


0.446 


KF680442 


TRI_54 


F 
R 


GTTTGACCAAGGTTGACCAG 
GGGAAAGAACAAGAAGGAAGG 


(AG)26 


121-123 


2 


0.083 


0.080 


0.077 


KF680443 


TRI_56 


F 
R 


CTAATCGATTGAGGTTCGTGG 
TTGGCAGCAATCTAAGTCCC 


(AG)i5 


138-142 


3 


0.818 


0.661 


0.652 


KF680445 


TRI_57 


F 
R 


CAGCTGCTATTGCTCTCAGC 
TATTTCCAACCAATCTCCCG 


(AG)i4 


147-155 


3 


0.833 


0.517 


0.420 


KF680446 


TRI_58 


F 
R 


G AAC AT C C C AAC AAC AT C C C 
TAGTGGTCGGCAAGCTAGTG 


(AG)i4 


266-268 


2 


0.636 


0.483 


0.471 


KF680447 


TRI_59 


F 
R 


GGTGGATGTGGCAGTGTTAG 
GATCCGAAATTTGCCGTTAC 


(AG)i3 


190-192 


2 


0.667 


0.444 


0.346 


KF680448 


TRI_62 


F 
R 


TAGCGACGGATAAGCTAGGG 
TTATTCTGCCATCACCGCTC 


(AG)„ 


175-187 


3 


0.091 


0.169 


0.281 


KF680450 



Note: A = number of alleles per locus; = expected heterozygosity; = observed heterozygosity; PIC = polymorphism information content. 
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